Search results for "Audio signal"

showing 10 items of 30 documents

2015

Visuo-auditory sensory substitution systems are augmented reality devices that translate a video stream into an audio stream in order to help the blind in daily tasks requiring visuo-spatial information. In this work, we present both a new mobile device and a transcoding method specifically designed to sonify moving objects. Frame differencing is used to extract spatial features from the video stream and two-dimensional spatial information is converted into audio cues using pitch, interaural time difference and interaural level difference. Using numerical methods, we attempt to reconstruct visuo-spatial information based on audio signals generated from various video stimuli. We show that de…

Audio signalComputer Networks and Communicationsbusiness.industryComputer scienceSpeech recognitionMotion detectionTranscodingAudio signal flowVideo processingcomputer.software_genreSensory substitutionArtificial IntelligenceHardware and ArchitectureSonificationComputer visionArtificial intelligencebusinessAudio signal processingcomputerSoftwareInformation SystemsFrontiers in ICT
researchProduct

Real-time signal processing in embedded systems

2016

International audience

010302 applied physics[ INFO ] Computer Science [cs]business.industryComputer science020206 networking & telecommunications02 engineering and technologycomputer.software_genre01 natural sciencesSignalHardware and Architecture0103 physical sciences0202 electrical engineering electronic engineering information engineeringReal time signal processing[INFO]Computer Science [cs]businessAudio signal processingcomputerSoftwareDigital signal processingComputer hardwareComputingMilieux_MISCELLANEOUS
researchProduct

A Comparative Analysis of Residual Block Alternatives for End-to-End Audio Classification

2020

Residual learning is known for being a learning framework that facilitates the training of very deep neural networks. Residual blocks or units are made up of a set of stacked layers, where the inputs are added back to their outputs with the aim of creating identity mappings. In practice, such identity mappings are accomplished by means of the so-called skip or shortcut connections. However, multiple implementation alternatives arise with respect to where such skip connections are applied within the set of stacked layers making up a residual block. While residual networks for image classification using convolutional neural networks (CNNs) have been widely discussed in the literature, their a…

Normalization (statistics)General Computer ScienceComputer scienceFeature extractionESC02 engineering and technologycomputer.software_genreResidualConvolutional neural networkconvolutional neural networks0202 electrical engineering electronic engineering information engineeringGeneral Materials Scienceurbansound8kAudio signal processingBlock (data storage)Contextual image classificationGeneral EngineeringAudio classification020206 networking & telecommunications113 Computer and information sciences020201 artificial intelligence & image processinglcsh:Electrical engineering. Electronics. Nuclear engineeringData mininglcsh:TK1-9971computerresidual learningIEEE Access
researchProduct

Self-Organizing Architectures for Digital Signal Processing

2013

Self-organizationSettore ING-INF/05 - Sistemi Di Elaborazione Delle InformazioniComputer sciencebusiness.industrycomputer.software_genreSignalDigital image processingDigital Signal ProcessingDigital signalbusinessAudio signal processingcomputerDigital signal processingComputer hardwareComputer Architectures
researchProduct

Capturing and Indexing Rehearsals: The Design and Usage of a Digital Archive of Performing Arts

2015

International audience; Preserving the cultural heritage of the performing arts raises difficult and sensitive issues, as each performance is unique by nature and the juxtaposition between the performers and the audience cannot be easily recorded. In this paper, we report on an experimental research project to preserve another aspect of the performing arts—the history of their rehearsals. We have specifically designed non-intrusive video recording and on-site documentation techniques to make this process transparent to the creative crew, and have developed a complete workflow to publish the recorded video data and their corresponding meta-data online as Open Data using state-of-the-art audi…

Digital archivingComputer science[ INFO.INFO-WB ] Computer Science [cs]/Web02 engineering and technology[ INFO.INFO-CV ] Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]computer.software_genre[SHS.MUSEO]Humanities and Social Sciences/Cultural heritage and museologyvideo processingWorld Wide WebDocumentationopera11. Sustainability0202 electrical engineering electronic engineering information engineeringAudio signal processing[ INFO.INFO-MM ] Computer Science [cs]/Multimedia [cs.MM]HypervideoMultimediahypervideo[INFO.INFO-WB]Computer Science [cs]/Web[INFO.INFO-MM]Computer Science [cs]/Multimedia [cs.MM][INFO.INFO-CV]Computer Science [cs]/Computer Vision and Pattern Recognition [cs.CV]020207 software engineering[ MATH.MATH-NA ] Mathematics [math]/Numerical Analysis [math.NA]Video processingLinked dataperforming artsaudio processingCultural heritageWorkflowtheaterLinked Data[ SHS.MUSEO ] Humanities and Social Sciences/Cultural heritage and museology020201 artificial intelligence & image processingPerforming artscomputer[MATH.MATH-NA]Mathematics [math]/Numerical Analysis [math.NA]
researchProduct

On the relations between audio features and room acoustic parameters of auralizations

2013

The usual parameters in room acoustics are used to quantify the acoustic characteristics of rooms and their relation to the subjective perception of transmitted signals. Audio features (calculated with MIRToolbox) have been designed to study the relationships between the characteristics of musical audio files and their subjective perception. Both musical characteristics and acoustic parameters are oriented towards acoustic perception. By using auralizations with calibrated models of auditoriums and tools from the MIRtoolbox it is possible to jointly work with the calculation of audio features and room parameters. In this work, the statistical correlations between C80, STI, D50, EDT, RT and …

EngineeringSignalsAudio signalRelation (database)business.industrymedia_common.quotation_subjectSubjective perceptionAcousticsGeneral EngineeringAcousticsRoom acousticsPearson product-moment correlation coefficientsymbols.namesakePerceptionFISICA APLICADAsymbolsbusinessMATEMATICA APLICADAmedia_common
researchProduct

Modeling musical attributes to characterize ensemble recordings using rhythmic audio features

2011

In this paper, we present the results of a pre-study on music performance analysis of ensemble music. Our aim is to implement a music classification system for the description of live recordings, for instance to help musicologist and musicians to analyze improvised ensemble performances. The main problem we deal with is the extraction of a suitable set of audio features from the recorded instrument tracks. Our approach is to extract rhythm-related audio features and to apply them for regression-based modeling of eight more general musical attributes. The model based on Partial Least-Squares Regression without preceding Principal Component Analysis performed best for all of the eight attribu…

Set (abstract data type)Sound recording and reproductionMusicologyComputer scienceSpeech recognitionFeature extractionMusicalAudio signal processingcomputer.software_genrecomputer2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
researchProduct

Spectral estimators for Doppler analysis of intracoronary ultrasound catheters

2002

With the zero-cross-detection method (ZCD) it has been shown that it is not possible to achieve a reproducible quantitative, and robust evaluation of an inter-coronary audio signal. The authors define spectral estimators to analyze the Doppler-audio signal. Measurements in a blood flow model have shown that the ZCD method underestimates the expected velocity at all speeds. Spectral analysis allows the determination of the actual and peak velocity more robustly and precisely. >

Signal processingsymbols.namesakeAudio signalPeak velocityComputer scienceRobustness (computer science)AcousticssymbolsEstimatorSpectral analysisBlood flowDoppler effect[1991] Proceedings Computers in Cardiology
researchProduct

Decoding Children's Social Behavior

2013

We introduce a new problem domain for activity recognition: the analysis of children's social and communicative behaviors based on video and audio data. We specifically target interactions between children aged 1-2 years and an adult. Such interactions arise naturally in the diagnosis and treatment of developmental disorders such as autism. We introduce a new publicly-available dataset containing over 160 sessions of a 3-5 minute child-adult interaction. In each session, the adult examiner followed a semi-structured play interaction protocol which was designed to elicit a broad range of social behaviors. We identify the key technical challenges in analyzing these behaviors, and describe met…

Behavior Psychology Dataset Video analysis Speech Analysis AutismInter-action protocolsSocial and communicative behaviorInteraction protocol02 engineering and technologycomputer.software_genreAnnan data- och informationsvetenskapSession (web analytics)Activity recognitionTechnical challenges0202 electrical engineering electronic engineering information engineeringmedicineSocial behaviorAudio signal processingMultimediabusiness.industryDevelopmental disorders020207 software engineeringmedicine.diseaseSemi-structuredResearch questionsActivity recognitionProblem domainKey (cryptography)Autism020201 artificial intelligence & image processingArtificial intelligencePsychologybusinessOther Computer and Information SciencecomputerCognitive psychologySocial behavior2013 IEEE Conference on Computer Vision and Pattern Recognition
researchProduct

Video preprocessing for audiovisual indexing

2003

We address the problem of detecting shots of subjects that are interviewed in news sequences. This is useful since usually these kinds of scenes contain important and reusable information that can be used for other news programs. In a previous paper, we presented a technique based on a priori knowledge of the editing techniques used in news sequences which allowed a fast search of news stories (see Albiol, A. et al., 3rd Int. Conf. on Audio and Video-based Biometric Person Authentication, p.366-71, 2001). We now present a new shot descriptor technique which improves the previous search results by using a simple, yet efficient, algorithm, based on the information contained in consecutive fra…

AuthenticationSequenceInformation retrievalContextual image classificationBiometricsComputer scienceSpeech recognitionSearch engine indexingcomputer.software_genreObject detectionReduction (complexity)Face (geometry)PreprocessorAudio signal processingcomputerImage retrievalIEEE International Conference on Acoustics Speech and Signal Processing
researchProduct